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I^ , Abstract 

H ' In this paper we give a construction of cut sparsifiers of Bencziir and Karger in the dynamic streaming 

setting in a single pass over the data stream. Previous constructions either required muhiple passes or 
were unable to handle edge deletions. We use 0(l/e^) time for each stream update and 0{n/e'^) time to 
fvq , construct a sparsifier Our e-sparsifiers have 0{n log n/e^) edges. The main tools behind our result are an 

pv) ■ application of sketching techniques of Ahn et al. [SODA' 12] to estimate edge connectivity together with a 

novel application of sampling with limited independence and sparse recovery to produce the edges of the 
ry^ ■ sparsifier. 

q 

^ : 1 Introduction 
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We study the problem of graph sparsification on dynamic graph streams. Graph sparsification was introduced 
by Bencziir and Karger ||BK96i . who gave a near linear time procedure that takes as input an undirected graph 

O ■ G on n vertices and constructs a weighted subgraph H of G with 0{nlogn/e^) edges such that the value 

^ , of every cut in H is within a 1 ± e factor of the value of the corresponding cut in G. This algorithm has 

^ I subsequently been used to speed up algorithms for a host of applications involving cuts and flows such as 

finding approximately minimum or sparsest cuts in graphs ([BK96, KRV09I1 ) as well as other applications 

Q . (e.g. IIKL02I ). Spielman and Teng introduced a stronger class of sparsifiers called spectral sparsifiers [IST081 . 

^ I Subsequent work has developed a number of efficient algorithms for constructing cut and spectral sparsifiers 

^ ' llBK96l[SS08llBSS09[[KMSTT0llFHHPllL[KPT2l] . 

The algorithms developed in IIBK96[[SS08llFHHPllllKP12ll take near-linear time in the size of the graph 

S^ I and produce very high quality sparsifiers, but require random access to the edges of the input graph G, which 

is often prohibitively expensive in applications involving massive data sets. The streaming model of compu- 
tation, which restricts algorithms to use a small number of passes over the input and space polylogarithmic 
in the size of the input, has been studied extensively in various application domains — see |Mut06| for an 
overview — but has proven too restrictive for even the simplest graph algorithms. Even testing s — t connec- 
tivity requires Q{n) space IIHRR99I . The less restrictive semi-streaming model, in which the algorithm is 



S 



restricted to use 0{n) space, is more suited for graph algorithms IIFKM "'"051 IMcG09l . 

The problem of constructing graph sparsifiers in the semi-streaming model was first considered by Ahn 
and Guha IIAG09I . who gave a one-pass algorithm for finding Bencziir-Karger type sparsifiers with a slightly 
larger number of edges than the original Bencziir-Karger algorithm, 0(7T,logrilog — /e^) as opposed to 
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0(?i log n/e^) using 0{n) space. Subsequently, HKLllI obtained an algorithm for constructing stronger spec- 
tral sparsifiers of size 0(n log n/e^) in a single pass in the streaming model. All of these algorithms work 
only in the incremental model, where edges can be added to the graph but not removed. 

In a recent paper IIAGM121 Ahn, Guha and McGregor introduced a beautiful graph sketching approach to 
streaming computations in dynamic streams, i.e. allowing both edge additions and deletions. They showed that 
connectivity can be determined in 0{n) space in this setting, and gave a multi-pass algorithm for obtaining cut 
sparsifiers in small space. Their techniques center around the use of linear sketches, which have been heavily 
studied in the field of compressed sensing/sparse recovery originating in IICRT06[|Don06ll . See BGIIOI for a 
survey. The focus of this paper is to provide a single-pass implementation of cut sparsification on dynamic 
streams in the semi-streaming model. 

Our results: Our main result is an algorithm for constructing cut sparsifiers in a single pass in dynamic 
streams with edge deletions. We prove 

Theorem 1. There exists a single-pass streaming algorithm for constructing an e-cut sparsifier of an un- 
weighted, undirected graph G = {V, E) with n vertices and m edges in the dynamic model using 0{n/e^) 
space. The size of the sparsifier is 0(n log n/e^), and the runtime of the algorithm is 0(l/e^) per update. 
At each point in the stream we can recover the edges of the sparsifier in time 0{n/e^). 

Our sparsification algorithm works by sampling edges at a rate inversely proportional to their edge con- 
nectivity, which was shown to work in HFHHPlll . In order to do this we maintain two sets of data structures. 
The first estimates connectivities, and the second does the actual sampling. We estimate connectivities by 
sampling edges of the input graph at a geometric sequence of sampling rates and recovering connected com- 
ponents of these samples using a result of IIAGM12i The second set of data structures stores a linear sketch 
of the actual samples we use in our sparsifier, also sampling at a geometric sequence of rates. Using sparse 
recovery and the linearity of our sketch we are able to reconstruct the necessary samples when needed. 

Organization: We start by giving preliminaries on graph sparsification in Section |2l We then describe the 
algorithm in Section [3] and Section H] Maintaining our samples in the dynamic model requires knowing, for 
each edge in the graph, whether or not it was included in each sample. This can be easily achieved if we 
assume that our algorithm has access to G(n^) independent random bits, which, however, is not feasible 
in 0{n) space. For simplicity of presentation, we first describe our algorithm assuming that it has access to 
G)(n^) random bits in Section|3] We show how to obtain sufficiently good estimates of edge connectivities in a 
single pass, as well as recover the edges of a sparsifier using sparse recovery techniques. In Section|4]we show 
how to remove the assumption that the algorithm has access to 0(n^) independent random bits using random 
variables with limited independence, obtaining a 0(n/e^) space single-pass algorithm for sparsification in the 
dynamic model. 

2 Sparsification preliminaries 

We will denote by G{V, E) the undirected input graph with vertex set V and edge set E with \V\ = n and 
\E\ = m. For our purposes G will be unweighted, but the results in this section apply to weighted graphs. 
For any e > 0, we say that a weighted graph G'{V, E') is an e-sparsification of G if the weight of every cut in 
G' is within (1 ± e) of the corresponding cut in G. 

Sparsification algorithms work by sampling edges with probabilities inversely proportional to some mea- 
sure of connectivity. The simplest of these is edge-connectivity: 

Definition 2. A graph G is fc-connected if the value of each cut in G is at least k, and an edge e has edge- 
connectivity Cg ifce is the value of the minimum cut separating its endpoints. 
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Graphs with high fc-connectivity are particularly simple to sample: 

Theorem 3 ( IIKar99i ). Let G = (F, E) be a k-connected graph on n nodes, and let G' be obtained from G 
by sampling edges independently with probability p = ©(log n/[e^k)), and giving sampled edges weight \/p. 
Then G' is an e-sparsifier of G with high probability. 

The Bencziir-Karger algorithm samples according to a more strict notion of connectivity, referred to as 
strong connectivity, defined as follows: 

Definition 4 ( IIBK96II ). A A;-strong component is a maximal k-connected vertex-induced subgraph. The strong 
connectivity of an edge e, denoted by Se, is the largest k such that a k-strong component contains e, and we 
say e is /c-strong if its strong connectivity is k or more, and fc-weak otherwise. 

The following two lemmas will be useful in our analysis: 

Lemma 5 ( IIBK96I ). The number ofk-weak edges in a graph on n vertices is bounded by k{n — 1). 

Lemma 6 ( l|BK96n ). Let G = (V, E) denote an undirected graph on n nodes. For an edge e ^ E let Sg 
denote the strong connectivity ofe. Then ^^^^ 1/se < n — 1. 

We also rely on Bencziir and Karger's main result, which is as follows: 

Tiieorem 7 ( ||BK96n ). Let G' be obtained by sampling edges ofG with probability p^ = min{/9/(e^Se), 1}, 
where p = 16{d + 2) Inn, and giving each sampled edge weight 1/pe- Then G' is an e-sparsification of G 
with probability at least 1 — n~'^. Moreover, the expected number of edges in G' is O(nlogn). 

It follows easily from the proof of Theorem |7]in IIBK96II that if we over-sample by using an underestimate 
of edge strengths, the resulting graph is still an e-sparsification. 

Corollary 8. Let G' be obtained by sampling each edge of G with probability p^ > pe and and give every 
sampled edge e weight l/pe. Then G' is an e-sparsification ofG with probability at least 1 — n~'^. 

Recently Fung et al. BFHHPllH proved that a more aggressive sampling method, namely sampling using 
edge connectivities as opposed to strong connectivities, also produces cut sparsifiers, and we will also require 
this result. 

Theorem 9 dlFKHPllH ). Let G' be obtained from a weighted graph G by independently sampling edge e with 
probability pe = p/ce, where p = 0(log^ n/e^). Then, G' contains 0{n log^ n/e^) edges in expectation and 
is an e-sparsification whp. 

3 Sparsification with free randomness 

In this section we present a dynamic spai^sifier under the assumption that the algorithm has access to ©(n^) 
random words. We will remove this assumption in Section |4] 

Our input graph G is undirected and unweighted. As in IIAGM12II . we will use the following representation 
ofG: 

Definition 10. Given an unweighted graph G = {V, E), let Aq be the n x (2) matrix with entry {u, {v, w)) € 

[n] X ('2 ) and v < w given by 



^u,{v,w) 



1 ifu = V and {v, w) £ E 
— 1 ifu = w and {v, w) £ E 
otherwise 



Updates to the graph G in the form of the addition or deletion of an edge arrive one at a time in a streaming 
fashion. An update cannot delete an edge that does not exist or add one that already does, but other than these 
restrictions the order is adversarial, and the stream can be arbitrarily long. We need to maintain a data structure 
using only 0{n) space that allows us to efficiently construct an e-sparsifier of the current graph G after any 
sequence of updates. We will accomplish this using a collection of linear sketches of the rows of Aq- 

Our algorithm has two components: the first will maintain an estimate of the connectivity of each edge 
and therefore its sampling rate (as discussed in Section |2]i, and the second will store the actual samples. The 
former uses the tools developed by Ahn et a/. IIAGM12ll . and the latter is based on the technique of sparse 
recovery developed in the sketching and compressed sensing literature HGIlOi 

Before delving into the details, we elaborate on our use of randomness. In this section we assume that 
for each pair {u,v) € [n]^ the algorithm has access to a uniformly random number /i(„„) € [0,1]. In 
fact, we will need O(logn) independent copies of these random numbers, which we will denote by h^^_^y 
6 = 1,..., 0(log n). These random variables will be used to estimate sampling rates for edges of G. We will 
also assume access to independent random numbers ^^ s G [0, 1], 6 = 1, . . . , 0(log n), which we will use 
to determine a partition of the vertex set needed for sampling. Finally, we also assume access to independent 
random numbers ^rf . G [0, 1] which will be used to sample edges of the sparsifier. Once g, g* and h are 
sampled, they are fixed for the duration of the algorithm. This is important for handling deletions, as it ensures 
that edges can be removed from exactly those sketches to which they have been added. 

It will be convenient to think of all these numbers as independent in this section, even though this is not 
feasible in subquadratic space in the semi-streaming model. In Section |4] we will show that using numbers 
that are only 0(l)-wise independent for fixed u and independent for different v is sufficient, leading to a 
space-efficient solution. Some of our subroutines will also require their own internal entropy, but the total 
used will be only 0{n) words. 

3.1 Estimating edge connectivity 

The building block of our connectivity estimates is the following result from IIAGM12II that finds connected 
components by sketching the rows of Aq- 

Theorem 11 ( IIAGM12II ). There is a single-pass, linear sketch-based algorithm supporting edge additions 
and deletions that uses 0{n log n) space and returns a spanning forest of the graph with high probability. 

Our sketch is simple. We consider random samples of the input graph at geometric sampling rates and find 
connected components in each sample. Specifically, for each a = 1, . . . , 0(log n) denote by G\ = {V, E^) a 
subsample of the edges of G obtained by setting 

E', = [{u,v) e E : min{hl^^^,hl^^^} < 2-"} . (1) 

We maintain connectivity data structures C^ for each of the subgraphs G^ for a = 1, . . . , O(logn), b = 
1, . . . , 0(log n) using Theorem [TT] It is important to note that use of the functions /!(„ t,) for sampling rather 
than fresh randomness allows us to handle deletions properly by deleting a removed edge only from the 
sketches that we added it to by simply sampling and using the consistent samples as input to the sketch in 
Theorem [TT] 

Remark 12. Note that an edge {u, v) is included in E^ if the minimum ofh, . and h, ^n is smaller than 2~°. 
This will be important for the proof of correctness for hash functions with limited independence in Section [?] 

The connectivity structure of the subgraphs G\ allows us to associate a sampling rate with each edge. For 
an edge {u, v) £ E we use the smallest sampling rate 2~" at which u and v are still in the same component 



as an estimate of the sampling rate for {u,v). Independent repetition O(logn) times reduces the variance 
sufficiently to get precise estimates. 

Define Va as the partition of vertices in V induced by the intersection of all partitions C^, 6 = 1, . . . , 0(log n) 
That is, vertices u and v are in the same connected component in Va if and only if they are connected in C^ 
for all 6 = 1, . . . , 0(log n). For an edge (u, v) let L{u, v) — the level of (n, v) — denote the largest a such that 
u and V are in the same component in Va, and for a vertex v let the level L{v) denote the largest a such that v 
is not a singleton in Va- 

The level L{e) of an edge serves as a proxy for its connectivity: 

Lemma 13. For all edges e ^ E one has 

e(se/logn) <2^(") <2Ce 
with high probability, where Se denotes strong connectivity and Cg denotes edge connectivity. 

Proof. The first inequality follows from the fact that 0(logn) • 2^-strongly connected components will stay 
connected with high probability by Theorem |3] when the functions h!^, ^^ used for sampling are truly random. 
Lemma|26]from Section |4]gives the result for hash functions with limited independence. 

The second inequality follows by noting that if there is a cut separating u and v of size at most 2^/2, then 
it will be empty with probability at least 1/2 when we sample at rate 2~^ by Markov's inequality, so u and v 
will get disconnected in one of the 0(log n) independent repetitions with high probability. D 

This implies the levels can be used as sampling rates: 

Lemma 14. Sampling edges independently at rate pe = O (log n/(e^2 '^')) and weighting sampled edges 
with 1/pe produces a sparsifier with 0{n log^ n/e^) edges high probability. 

Proof. By Theorem |9] sampling at rate 0(log^ n/(e^Ce)) works. By Lemma [13] 1/2^^'^) > l/(2ce), and 
oversampling only improves concentration. This proves the statement assuming that the sampling of edges is 
independent. 

The expected size of the sample is bounded by Ylie<^EPe = 0(log^ n/e^) ■ Ylie<^E ^/'^e = 0{n\o^ n/e^), 
where we used Lemma [T3]to bound pe = 0(log n)/se and the fact that X^gg^; l/sg < n — 1 by Lemma[6l D 

3.2 Maintaining edge samples 

We now show how to maintain small space sketches that will allow us to reconstruct the edges of the sparsifier. 
Our basic tool is the technique of sparse recovery from the field of compressed sensing HGIlOi A vector A of 
dimension N is /c-sparse if it has at most k non-zero entries, and a fc-sparse, or approximately fc-sparse, signal 
can be recovered with high probability from 0{k \og{N/k)) non-adaptive linear measurements. Here we use 
the following result of Cormode and Muthukrishnan IICM061 that allows recovery in 0{k) time at the cost of 
slightly sub-optimal sketch size and error guarantees: 

Theorem 15 ( IICM06II ). We can construct a randomized 0/1 matrix T of dimension 0{ck log n/e^) x N such 
that for any k-sparse signal A of dimension N, given the transformation TA we can reconstruct A exactly 
with probability at least 1 — n~'^ in time 0(c^/clog n/e^). The matrix T is constructed using 0{l)-wise 
independent hash functions, and individual entries can be queried efficiently. 

We will also need the following result by Indyk IllndOOII on sketching li norms: 

Theorem 16 ( lUndOOII ). There is a linear sketch-based algorithm using 0{c\og N/e^) space and O {clog N/e^) 
random bits that can estimate the ii norm of a vector of dimension N to within a factor of (1 it e) with prob- 
ability 1 — N~'^. The sketch can be updated in 0(log N) time. 



For a row v of the matrix Ac let ^^(i;) for r = 1, . . . , 0(log ?i) denote linear sketches guaranteed by 
Theorem [15] for k = 0(log^n/e^) where a corresponds to the geometric sequence of sampling rates, and 
r = 1, . . . , 0(log n) are independent copies that are useful for recovery. More precisely, S^g^{v) is a sketch of 
row V in the matrix Aa' where G' ^ has edges 



a,r 



K,r = {{u,v)^E: min{5[, ,) , ff[,^„) } < 2"'^ } . (2) 

Some of these sketches may accumulate more than k edges and consequently cannot be decoded on their own, 
but we will prove this is not an issue. We will also need sketches (f^{v),r = 1, . . . , 0(log n) for the ^i-nonii 
of the w-th row of the matrix An' ■ Note row v of Aa' contains ail entry for each edge incident on v in 
G^ ^, SO the £i norm corresponds exactly to the degree of v in the sampled graph Gjj ^. 

Remark 17. Note that we are using 0(log n) independent samples G'^^for each sampling rate a. This will 
be important in the proof of Lemma \21\below. 

Finally, we will also need another set of independent samples of G that will be used to obtain the edges of 
the sparsifier. Let G* be the graph with edges 

El = [{u,v) G E : min{5(*„,,),5(V)} < 2^"} • (3) 

For each node v and sampling rate a we maintain sketches S*{v) of row v in the matrix Aq^ using Theorem [TS] 
for k = 0(log^ n/e^). Here we do not need independent repetitions for each sampling rate a. 

By the choice of the matrix Aq and the linearity of the sketches, if 5 C 1/ is a cut then Ylv&s ^a(^) i^ ^ 
sketch for the size of the cut and "^^iv&s ^ai'^) i^ sketch of a sample of its edges. If u is a supemode obtained 
by contracting a set of vertices S, we write d^(t;) to denote ^^^5 ^a(^) ^^^ similarly for 5^(w) and S*{v). 
For any fixed cut and fixed G'^ the estimate given by (ij^(f ) is close to expectation with high probability. 

Before specifying the algorithm formally, we give the intuition behind it. Recall that for every vertex u at 
level L{u) = a, we need to sample edges going from u to vertices in u's component in Va with probability 
7 log^ n/(e^2") for an appropriate constant 7. In order to do that, we will sample all edges incident on u with 
probability 7 log^ n/(e^2") and then throw away the ones that do not go to u's component. In order to obtain 
such a sample, we will use the sketches S^, (u) that were made with sampling at rate 7 log^ n/(e^2"). 

The main observation here is that if we contract connected components in Va+i into supemodes, the 
resulting graph will have only (7logn • 2"+^)-weak edges for a constant 7, so the average degree will be no 
larger than 7logn • 2"^^. By repeatedly removing vertices with degree at most twice the average, nodes of 
this subgraph can be partitioned into sets Wi, . . . ,Wt such that for each i = 1, . . . ,t and u € VFj the degree 
of u in T4^j U • • • U Wi is at most 47 log n2" and t = 0(Iog n). Formally, 

Lemma 18. Suppose all edges in G are k-weak. Then V can be partitioned into t = log n sets Wi , . . . ,Wt 
such that for all v £ Wi the degree ofv when restricted to Wi U • • • U Wt is at most 2k. 

Proof. By Lemma[5]if G has n' nodes then it has at most k{n' — 1) edges. Let Wi be the set of all nodes with 
degree at most 2k. By Markov's inequality Wi includes at least half the nodes. After removing Wi and all 
incident edges we can repeat this process to find W2, etc. At each iteration we remove at least half the nodes, 
so it temiinates in log n iterations. D 

This partition cannot actually be computed because we cannot properly update the degree sketches after 
removing Wi , but its existence allows us to prove that the same procedure works when using the lower degree 
sample G'^,,,. 

Let A = log(7 log^ n/e^). We need to use the samples S1_^{u) for edges at level a. We first bound the 
degrees in G;„^: 



Lemma 19. Let g"^, ^ be 0(log 7i /e'^)-wise independent for fixed u, and independent for different u. Then 

the degree of all u ^ Wi in G'^_^ ^ restricted to nodes Wi yj .. .VJWt is at most 0(log'^ n/e^) with high 
probability. 

Proof. Consider a vertex u € Wi and let N{u) denote the neighbors of u in l^j U . . . U W( in the full graph 
G. The size of its sampled neighborhood is bounded by 

veN{u) ' veN{u) 

The number of terms is 0{logn2°-). The second sum consists of independent random variables, so stan- 
dard Chemoff bounds apply. Concentration bounds from Theorem [25] apply to the first sum since they are 
sufficiently independent for expectation 0(log^ n/e^). D 

Observe that if a node u satisfies the bound in Lemma[T9l then the sketch S^_^{u) can be decoded using 
Theorem [15] Let Ui be the set of decodable nodes. We would like to argue that we can simply output a 
decoded edge {u, v) as part of the sparsifier if and only if (/L ^-i < 7 log^ n/(e^2°) and v belongs to the same 
connected component as uinVa. Then, using the linearity of the sketches, we could subtract decoded edges 
of the form {u,v),u G Ui,v G V \ Ui from the sketches S{v) and d{v), effectively removing Ui from the 
graph, and then move on to U2. 

However, for technical reasons to avoid dependencies and ensure the algorithm works in small space, we 
cannot reuse the variables gj_^ .. We need to calculate U2 using the independent sketches 5''"'"^ and d^'^^ as 
opposed to S"" and d"' to avoid dependencies and also use the variables </? , for the actual samples. The size 
of r will remain bounded since we will prove the process terminates in 0(log n) steps, but switching to S"""*"^ 
introduces additional complications because to remove a vertex u G Ui from the graph, we must be able to 
recover the edges from Sl^]^{u), . . . , S^_^^ (u). The following lemma accomplishes this: 

Lemma 20. Let ffL ^a be Q{log n/e'^)-wise independent for fixed u, and independent for different u, and 
suppose the degree ofu in G'^_^ ^ is at most a log n/e^ where a is the constant from Lemma [79l Then the 
degree ofu in G'^_^ ,^, is 0(log n/e^)for all r' > r with high probability. 

Proof. If the degree of u in G'^_^ ^ is at most a log'^ n/e^, we expect the degree in G to be at most (a/7) log n2", 
and concentration inequalities for sampling with limited independence (Theorem[25]l show that with high prob- 
ability its degree is at most, say, 2(0/7) log ^^2". Applying Theorem l25]again shows u's degree in G'^„^ ^, is 
0(log^ n/e^) for any r', and taking a union bound over all r' finishes the proof. D 

We now state the algorithm formally. For each a = 1, . . . , O(logn) we denote the graph obtained by 
contracting all connected components in Va+i into supernodes by Ha. 

Algorithm 1: PARTITION(a) 



Let Hi ^ Ha. 

for r ^ 1 to 0(log n) do 

Estimate the degree of each v G HJ^ from sketches d^a-Ai''^) 

U: ^{ve Hl\dl_^{v) < 4alog'n/e^} 

for u G Ua, j = r + 1, . . . , 0(log n) do 
Run sparse recovery on Sl^_^{u) 

For all edges (n, v) recovered from Sl^_^{u), subtract {u, v) from 5'^_^(t') and d-'^_^{v) 

end for 

i7^-+i ^ HI \ UI 
end for 
return {f/I}r=i,...,o{iogn) 



Here 7 is a constant such that sampling the edges of a fc-connected graph at rate 7 log n/k produces a 
connected subgraph with probability at least 1 — n~^^, and a is a constant bounding the degree in Lemma [191 
We first prove 

Lemma 21. For all a = 1, . . . , 0(log n), Algorithm [7] recovers a partition of Ha such that for all r = 
1, . . . , 0{logn)for each u G C/^' the degree ofu in U^+^ U . . . U [/f ('°S") in graph G*_^ is 0(log^ n/e^) 
with high probability. 

Proof. We first prove that the constructed sets cover all of Ha. Consider the set Ui. By Lemma [T9l all 
u E Wi have degree 0{log^ n/e^) in G'^_^ ^ with high probability, so removing all nodes with degree at 
most Aa log"^ n/e^ for lai^ge enough a will include all u G Wi. Lemma |20] implies that the sparse recovery in 
line[6]will succeed for all j with high probability, so we can completely remove C/^ from the graph. Lemmal20l 
also bounds the degree of u G U^ in graph G*_^, by replacing G'^_^ ^,, with G*_^ in the statement of the 
lemma. 

We now note that the identity of the set C/^ is independent of the randomness used for samples and 
sketches S-'^_^, d-'^_^, j = r + 1, . . . , 0(log n). Thus, the same bounds on node degrees follow by a recursive 
application of the argument to Ha \ Ua. Furthermore, it follows by induction on r that after removing 
Ua, ■ ■ ■ ^Ua^e. have removed all of Wi, . . . , Wr with high probability, so the algorithm terminates in 0(log n) 
iterations. D 

Given the partition Ua, ■ ■ ■ ,Ua ^ °^ "' , the algorithm for recovering the edges of the spai^sifier is as follows: 
Algorithm 2: RECOVER(a) 



for r = 1, . . . , 0(log n), u G U'a do 

Run sparse recovery on 5'*_^(u) 

Output each recovered edge (u, v),u & Ur only if (7? ^^ < 7 log^ n/{e'^2°-) and L{u, v) = a. 

Subtract recovered edges from S*_^{v) for all v G Ua,r+i U . . . f^a,o(iogn)- 
end for 



We can now prove 

Theorem 22. For each v G V{G) Algorithm\2\ recovers a sample of edges incident on v, where edges are 
picked with probability 7 log n/(e^2 '^•' ). 



Proof. Note that sparse recovery succeeds whp by the degree bound in Lemma 121] Finally, note that the 
structure of the partition UiU . . . UUr maps to each edge a single random variable g{u,v)^ so the probability 
of an edge being sampled is correct. D 

We will need the following definition in Section ID 

Definition 23. An edge e = {u,v) a E is controlled by a vertex u ^V ife is sampled using gJ .. We denote 
the set of edges controlled by u by E^- 

We will also need 

Lemma 24. Let E* be a set of edges. For each u €V one has E[\E* n £'„[] = O(log^ n/e^). 

Proof. Consider a vertex n G F. By Lemma |2T] u controls 0(log^ ?i2"/e^) edges at level a. Hence, the 
expected number of edges sampled at each level a is 0(log^ n/e^). Hence, the expected number of edges 
controlled by u across all levels is 0(log'^ n/e^). D 



3.3 Runtime 

We now briefly summarize the time required to update the sketches and to construct a sparsifier. We do 
not optimize the log factors but only show updates require 0(l/e^) time and building a sparsifier requires 
0{n/e'^). Each addition of deletion of an edge requires updating C^, S^ and (F^ for a,b,r < 0(log n). The 
sketches C^ are built using £o-samplers (see IIAGM12II ) and can be updated in 0(1) time. For an edge (n, v) 
we update all O(logn) copies of S{u), S{v), d{u) and d{v). By Theorems [15] and [T6l these can each be 
updated in 0(l/e^) time. For S{u) this is done by querying only the 0(l/e^) entries of the matrix T we 
need. 

Construction of the sparsifier is more expensive. If we query the sparsifier after each graph update it may 
need to be recomputed from scratch each time due to edge deletions, so we cannot amortize its cost across 
the updates. However, we will show it requires only 0{n/€^) time. Building all O(logn) Va requires 0{n) 
operations each if £o-sampling is done efficiently. 

Running one iteration of Algorithm [Urequires 0{n) estimations of d, 0{n) decodings of 5, 0{k) updates 
to 5 and d for each of the 0{n) sparse recoveries and 0{n) additional bookkeeping. Since S is 0(l/e^)- 
sparse by Theorem [T5] decoding takes 0(l/e^) time. Summing over 0(1) values of r and a, we use a total of 
0{n/e^) time. Algorithmic] also does 0{n) spai^se recoveries and 0{n/€^) updates to S* per iteration, which 
also totals to 0{n/e^) summing over all r and a. 

3.4 Weighted graphs 

We note that even though we stated the algorithm for unweighted graphs, the following simple reduction 
yields a single pass dynamic sparsifier for weighted graphs, as long as when an edge is removed or updated, 
its weight is given together with the identity of its endpoints. Suppose that edge weights are integers between 
1 and W (the general case can be reduced to this one with appropriate scaling and rounding). Consider graphs 
Go, . . . , Giogj VK> where an edge e = {u, v) belongs to the edge set of Gh iff the binary expansion of We has 1 
in position 6, for 6 = 0, ... , log2 W . Note that in order to presei^ve cuts in G to a multiplicative factor of 1 ± e, 
it is sufficient to preserve cuts in each of Go , . . . , Gb to the same factor. To do that, it is sufficient to maintain 
log2 W copies of our algorithm operating on the graphs G;, (this is feasible due to the assumption that edges 
aie either added or completely removed, i.e. the weight of the removed edge is given at the time of removal). 
The space used and the number of edges in the sparsifier will both increase by a factor of log2 W. 

4 Sparsification with limited independence 

In this section we remove the assumption that the algorithm has access to 0(n^) bits of randomness by using 
sampling with limited independence. We prove the following two statements. First, we show in Lemma |26] 
that sampling edges of a /c-connected graph at rate 7 log n/k yields a connected graph with high probability, 
even with limited independence. In particular, it is sufficient to ensure that random variables used for sampling 
edges incident to any given vertex are only 0(l/e^)-wise independent. This lemma is used in Section [3]to 
show that our estimation of sampling rates is accurate. We then show that our algorithm for constructing a 
sparsifier by sampling at rates proportional to edge connectivities yields a sparsifier with high probability even 
when the sampling is done using limited independence. We note that the second claim does not subsume the 
first due an extra log n factor that is needed for sampling with edge connectivities to go through. 
We will use tail bounds for t-wise independent random variables proved in IISSS951 . Theorem 5: 

Theorem 25. Let Xi , . . . , X^ be random variables each of which is confined to [0, 1]. Let X = ^^11=1 ^j > A' = 
^[X]. Letp = [ijn, and suppose that p < 1/2. Then if Xi are \e^~\-wise independent, then 

Pr[|X-/i|>e^]<e-L^''^/3J, 



ife<\, and 

Vr[\X - ^1 > e^] < e-^i'^^i+^W^ ^ g-e^/s 

We now prove 

Lemma 26. Let G = (Vi -E) be a k-connected graph on n nodes. For edges e = {u,v) (z E let random 
numbers h^^v £ [0,1] be such that 

1. hu,v i^ independent ofhu'y far all u' ^ u; 

2. hu,v ore [47 log n] -w/5'e independent far fixed u, where 7 > 20. 

Also, let Xu^y be 0/1 random variables such that Xu^v = 1 if hu,v ^ {'ylogn)/k and otherwise. If G' is 
obtained by including each edge (n, v) & E such that X^ „ = 1 or Xy^^ = 1. then G' is connected whp. 

Proof. First, for each e = {u,v) G E let Xu,v denote 0/1 random variables such that Xu,v = 1 if hu,v < 
(7 log n)/se, where Sg is the strong connectivity of e. Define G" as the graph obtained by including each edge 
{u, v) G E such that Xu,v = 1 or Xv,u = 1- Note that G' is a subgraph of G', so it is sufficient to show that 
G" is connected whp. 

Suppose that F = [Vp, Ep) is a A;-connected graph without 2/i;-strongly connected components for some 
k. Recall from Lemma [T8] that Vp can be partitioned into log \Vp\ sets Wi, . . . , Wi^g lypi such that the degree 
of any u € Wr in Wr U • • • U Wjog \Vp\ is at most A.k. For each node u G Wr let Eu denote the edges incident 
on u that go to nodes in Wr U • • • U Wjog \yp\ (if an edge e = [u, v) goes between two nodes in Wj., include 
it either in E^ or E^ arbitrarily). We will say vertex u G Wr controls edges e € Eu. Note that \Eu\ < 4/c 
for all u G Wr, and hence the expected number of edges sampled in Eu is at most 47 log n. Note that this 
definition of control is slightly different from the one given in Definition [23] In particular, this is because 
Definition |23]pertains to the actual sampling procedure that our algorithm uses, while here we are concerned 
with the estimation step. 

Let jmax = LI0S2 "^J- We will show by induction on j = jmax, • • • , that all 2-^ -connected components 
are connected with probability at least 1 — {jmax — j + l)n~^. 

Baserj = jmax We have k = 2-^™"^. Apply the decomposition above to the K-strongly connected components 
of G, which does not have any 2K-connected components since 2k > n. Let G" denote the subgraph 
of G obtained by including for each u G V edges e = (n, v) € Eu when X„ „ = 1. Denote the set of 
sampled edges by E'. Recall that for all u G V{G) one has E[\E' D Eu\] < 47 log n. 

Fix a cut (C, V \ G). For each vertex n G C let X„ = Y.{u,v)GEn,v^c ^u,v 
By setting e = 1 in Theorem |25] we get 

Pr[X„ = 0] < e-'^l^-l/^. 



g-E[X„]/3 ^g-7|C|logn/(3fc)_ 



Since X^, X^' are independent for u 7^ u' , the probability that the cut is empty is at most 

n 

u&C 

By Karger's cut counting lemma, the number of cuts of value at most ak is at most n^". A union bound 
over all cuts, we get failure probability at most 

^^2ag-7alogn/3 <^-4 

since 7 > 20. Taking a union bound over all K-connected components yields failure probability at most 



Inductive step: j + 1 — > j We have k = 2-'. By the inductive hypothesis, all 2-' "'"^-connected components 
will be connected with probability at least 1 — {jmax — {j + 1) + l)n~^. We condition on this event 
and contract the connected components into supernodes. 

We now have a union of vertex-disjoint K-strongly connected components that do not contain any 2k- 
connected components. The same ai^gument as in the base case shows that each such component will be 



connected with probability at least 1 
the inductive step. 



n .A union bound over at most n such components completes 



D 



In order to show that the results of IIFHHPllI carry over to our setting, it is sufficient to show that the 
following version of Chernoff bounds holds under our limited independence assumptions (Theorem 2.2 in 
OFHHPllH ^: 

Theorem 27. Let Xi, . . . , Xn be n random variables such that Xi takes value 1/pi with probability pi and 
otherwise.Then, for any p such that p < pi, for each i, any e € (0, 1) and any N > n the following holds: 



Pr 



E^^ 



n 



>€N 



<2e 



-0.38e^pN 



Indeed, an inspection of the proofs of Lemma 4. 1 and Lemma 5.5 in MFHHPllI shows that the authors (a) 
only rely on independence of their sampling process to obtain Theorem |27] and (b) only apply Theorem |27] to 
subsets of edges of G, where pi are sampling probabilities.Thus, proving an equivalent of Theorem |27] allows 
us to extend their results to our limited independence sampling approach. 

We now prove 

Lemma 28. Let G = {V, E) denote an unweighted undirected graph. Let 7 > be a sufficiently large 
constant such that sampling at rate 7 log 7i/ce independently produces a sparsifier with probability at least 
^, where Ce is the edge connectivity ofe. Let X^,, e £ E be random variables corresponding to including 



-n 



1 

edges from a set E* into the sample such that X^ takes value 1/pe with probability p^ and otherwise, where 
Pe is the sampling probability used by Algorithm^ Assume that sampling is c log n/e^-wise independent for 
a sufficiently large constant c > that may depend on 7. 



There exists an event £ with Vr[£] 
e€ (0,1) and any N > \E*\ 



> 1 



n 



such that for any E* C E, any p < Pe,e G E*, any 



Pr 



E^e 



\E*\ 



e&E* 



>eN\£ 



< e~ 



^pN/e 



Proof. For simplicity of exposition, we now assume that G is unweighted. Let Xi, . . . , X„ be random vari- 
ables con^esponding to picking edges of the graph. Recall that our sampling algorithm samples an edge (n, v) 
either depending on the value of g(^u,v) or the value of 5'(^, „) (the choice depends on the partition of the node 
set Ui U ... U Ur constructed in Algorithm |2]i. Recall that by Definition |23] a node u controls edge (n, v) 
if Algorithm |2] samples {u,v) using the value of gT^^A- Note that each edge is controlled by exactly one 
node. For a node u, as before, let E^ denote the set of edges controlled by u. By Lemma |24j one has 
E[|£'* n £^1(1] = 0(log^n/e^). Let £ denote the event that at most 27log^n/e^ edges controlled by u are 
sampled, for all u £ V, where we are assuming that 7 is sufficiently large. Since our random variables are 
clog^ n/e^-wise independent for sufficiently large c, by Theorem |25] and a union bound over all u one has 
Pr[£] > 1 - n-2. 

For each e G E let X^ be a Bernoulli random variable that takes value p/pe if edge e is sampled, and 
otherwise, so that Xe € [0, 1]. 



10 



Consider a set of edges E*. Paitition E* as E* = {J^^yE^, where E^ = E* f] Eu- Thus, random 
variables ^„ := YleeE* ^e are independent for different i. Let X = Yluev '^'^^ /^ ~ '^\'^\ 
Then by Markov's inequahty 



Pr[^>(l + 5)H^]<^1^. 



(4) 



Recall that 



n^fi^ 



.eGS 



(5) 



i=o i=o sc£;Msi<jae>o,Eees"==J 

For any non-negative random variably 3^ one has 

Conditional on E, one has HegS ^"" = ^ ^o"" ^^^ S C E* such that jS n £^„| > 27 log^ n/e^ for at least 
one u eV. For other S, setting 3^ = Yiees -^e^'' ^^^^ S^^ 



E 



U^e'\£ 



.eeS 



<E 



.ee5 



/Fr[£] 



(6) 



Combining Q and dS]) one gets 

-. 00 



Pr[f] 



j=o 



E E E 

SC£;MS|<i,|5n£„|<27log'in/e2ae>0,Eees"<!=i 



.eeS 



(V) 



On the other hand, for all S (^ E* such that \S D Eu\ < 27 log'' n/e^ one has 



E 



,eGS 



ee5 



by 7 log^ n/e^-wise independence. Thus, we get 



Pr[A' > (1 + 6)fi\£] < 



1 negz;»E[e^-^-] 



(8) 



which is the same bound as in the full independence case, except for a factor of l/Pr[<S] = 1 + 0{l/n) in 
front. Now the same derivation as in the full independence case shows that the probability of overestimating 
is appropriately small. 

We now bound the probability of underestimating. Consider a set of edges E* . Partition E* as E* = 
Ui=i Ei, where Ei nEj = 9,i^ j, so that 

1. E[^gg£. Xe] < clog^ n/e^ for a sufficiently large constant c > 0; 

2. random variables Y2eeE -^e are independent for different i; 



3. s< 



61og(4/e2) 



EEeeE.^e]. 
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Note that this is feasible since our graphs are unweighted, so e can be assumed to be larger than 1/n^. For each 
i = 1, . . . , s let ^j := X^gg^; Xe- Note that Xi are independent, and Xg are clog^ n/e^-wise independent. 
Now by Theorem I25] for alH = 1, . . . , s one has for all e € (0, 1) 



Let X = Y.i=i ^i- For constant e > let 



(9) 



K.{e) = < z = {zi,Z2,...,Zs 
We now have 



^[0,-/,le\le^...A-e'/4,iy:J2zMX,]>enA 



Pr [A" < E [X] - eE[£]] < ^ llPr[Xi < E[^,] - {zi - e^ifEiXi]] 

zelCie)i=l 
s 

< J2 Yl^r[X,<Em-zfE[X,] + {e^/2)Em] 

zG/C(e)i=l 



(10) 



since every set of values for Xi — E[Xi] such that Ylii'^i ~ ^i'^i]) < — eE[A'] can be rounded to a point 
in /C(e) with a loss of at most (e^/2)E[A'j] in each term. We now note that for any z G [0, 1]'^ such that 
J2t=i Zi^^i] = e'EiA"] > eE[X] one has J2Ui ^M^i] > W?H^] > e^E[^]. 



Next, since s < 



f(Ei=iE['^'i])> we have that 



61og(4/e^)' 



Theorem 29. The set of edges returned by Algorithm^is a sparsifier whp. 
Proof. Lemma |28]can be used instead of Theorem [27] in BFHHPllH . 



(11) 
D 
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